Experiences with a Multilingual Ontology-Based Lexicon for News Filtering
نویسندگان
چکیده
Ontologies as part of a lexicon are important for many NLP tasks. In this paper, we draw on experiences gained in an ESPRIT project called TREVI, which concerns news filtering and enrichment and includes a English/Spanish multilingual Lexicon. We sketch the way the Lexicon, including an ontology, is constructed, and present the methodology used for adding domain ontologies. In general, the approach is lexicon-driven in the sense that ontology (semantics) and lexicon (word forms) are developed in tan-
منابع مشابه
A Multilingual/Multimedia Lexicon Model for Ontologies
Ontology development is mostly directed at the representation of domain knowledge and much less at the representation of textual or image-based symbols for this knowledge, i.e., the multilingual and multimedia lexicon. To allow for automatic multilingual and multimedia knowledge markup, a richer representation of text and image features is needed. At present, such information is mostly missing ...
متن کاملBasic Lexicon and Shared Ontology for Multilingual Resources: A SUMO+MILO Hybrid Approach
A common conceptual infrastructure is crucial for multilingual language processing and documentation. Global Wordnet (GWN) was proposed as the common infrastructure for linguistically motivated conceptual representations for all languages. Two critical issues in this line of research are: the scarcity of lexical semantic information (especially from endangered languages), and the lack of a shar...
متن کاملEnhancing portability with multilingual ontology-based knowledge management
Information systems in multilingual environments, such as the EU, suffer from low portability and high deployment costs. In this paper we propose an ontology-based model for multilingual knowledge management in information systems. Our unique feature is a lightweight mechanism, dubbed context, that is associated with ontological concepts and specified in multiple languages. We use contexts to a...
متن کاملDAnIEL: Language Independent Character-Based News Surveillance
This study aims at developing a news surveillance system able to address multilingual web corpora. As an example of a domain where multilingual capacity is crucial, we focus on Epidemic Surveillance. This task necessitates worldwide coverage of news in order to detect new events as quickly as possible, anywhere, whatever the language it is rst reported in. In this study, text-genre is used rath...
متن کاملMultilingual Conceptual Access to Lexicon based on Shared Orthography: An ontology-driven study of Chinese and Japanese
In this paper we propose a model for conceptual access to multilingual lexicon based on shared orthography. Our proposal relies crucially on two facts: That both Chinese and Japanese conventionally use Chinese orthography in their respective writing systems, and that the Chinese orthography is anchored on a system of radical parts which encodes basic concepts. Each orthographic unit, called han...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 1998